Integrative Data Mining Highlights Candidate Genes for Monogenic Myopathies

نویسندگان

  • Osorio Abath Neto
  • Olivier Tassy
  • Valérie Biancalana
  • Edmar Zanoteli
  • Olivier Pourquié
  • Jocelyn Laporte
چکیده

Inherited myopathies are a heterogeneous group of disabling disorders with still barely understood pathological mechanisms. Around 40% of afflicted patients remain without a molecular diagnosis after exclusion of known genes. The advent of high-throughput sequencing has opened avenues to the discovery of new implicated genes, but a working list of prioritized candidate genes is necessary to deal with the complexity of analyzing large-scale sequencing data. Here we used an integrative data mining strategy to analyze the genetic network linked to myopathies, derive specific signatures for inherited myopathy and related disorders, and identify and rank candidate genes for these groups. Training sets of genes were selected after literature review and used in Manteia, a public web-based data mining system, to extract disease group signatures in the form of enriched descriptor terms, which include functional annotation, human and mouse phenotypes, as well as biological pathways and protein interactions. These specific signatures were then used as an input to mine and rank candidate genes, followed by filtration against skeletal muscle expression and association with known diseases. Signatures and identified candidate genes highlight both potential common pathological mechanisms and allelic disease groups. Recent discoveries of gene associations to diseases, like B3GALNT2, GMPPB and B3GNT1 to congenital muscular dystrophies, were prioritized in the ranked lists, suggesting a posteriori validation of our approach and predictions. We show an example of how the ranked lists can be used to help analyze high-throughput sequencing data to identify candidate genes, and highlight the best candidate genes matching genomic regions linked to myopathies without known causative genes. This strategy can be automatized to generate fresh candidate gene lists, which help cope with database annotation updates as new knowledge is incorporated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text Mining in Cancer Gene and Pathway Prioritization

Prioritization of cancer implicated genes has received growing attention as an effective way to reduce wet lab cost by computational analysis that ranks candidate genes according to the likelihood that experimental verifications will succeed. A multitude of gene prioritization tools have been developed, each integrating different data sources covering gene sequences, differential expressions, f...

متن کامل

Genes Predisposing to Monogenic, Polygenic, and Syndromic Obesity: A Review of Current Trends and Prospects for Standard Obesity Genetic Testing

Objective: The burden of obesity is currently enormous, necessitating a novel strategy to complement the existing ones. Accordingly, genetic predisposition is suspected in many cases of the disease, which can potentially be used as therapeutic targets. However, there are differing viewpoints on the suspect genes, prompting the current review to articulate the genes and their mechanisms. Eight (...

متن کامل

Idiopathic Calcium Nephrolithiasis And Hypercalciuria: The Role Of Genes

Idiopathic calcium nephrolithiasis and hypercalciuria are multifactorial disease conditions, the pathogenesis of which involves the interaction of environmental and individual factors. Data support a strong role of genes in the pathogenesis of these two conditions. Findings obtained in monogenic disorders characterized by renal calcium stones, and/or hypercalciuria, and/or nephrocalcinosis have...

متن کامل

Mining Disease-Resistance Genes in Roses: Functional and Molecular Characterization of the Rdr1 Locus

The interaction of roses with the leaf spot pathogen Diplocarpon rosae (the cause of black spot on roses) is an interesting pathosystem because it involves a long-lived woody perennial, with life history traits very different from most model plants, and a hemibiotrophic pathogen with moderate levels of gene flow. Here we present data on the molecular structure of the first monogenic dominant re...

متن کامل

New data and features for advanced data mining in Manteia

Manteia is an integrative database available online at http://manteia.igbmc.fr which provides a large array of OMICs data related to the development of the mouse, chicken, zebrafish and human. The system is designed to use different types of data together in order to perform advanced datamining, test hypotheses or provide candidate genes involved in biological processes or responsible for human...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014